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Amendments to the Claims: 

This listing of claims will replace all prior versions, and listings, of claims in 
the application: 

Cancel claims 1-42 without prejudice. 

Add the following new claims 43-46, as follows. 

Listing of Claims: 
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43. (new) A method for extracting records from a structured text in 
a computer system, comprising: 

identifying potential locations of values of record fields in the text by 
identifying locations in the text of items in lists of known potential values 
for record fields, 

identifying a region of interest in the text by applying multiple 
candidate region partitioners, evaluating each to measure how well it 
isolates a region with a high density and a high amount of potential 
locations of values of record fields, selecting one that measures best, and 
applying it to produce a region of interest, 

segmenting the region of interest into record regions that each 
contain data for a single record by applying multiple candidate 
segmenters, evaluating each to measure how well it segments into 
regions such that each region has one field value per record field and 
such that different regions have similar numbers of field values for each 
record field, selecting one that measures best, applying it to produce 
record regions, extracting field values from record regions by identifying 
most likely locations of field values for each record field in each record 
region, and 

outputting records composed of extracted field values for record 

fields. 

44. (new) The method of claim 43, with the addition of: 
identifying potential locations of values of record fields in the text by 

identifying locations in the text of patterns of potential values for record 
fields. 

45. (new) The method of claim 43, with the addition of: 
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identifying potential locations of values of record fields in the text by 
identifying locations in the text of numbers in ranges that are potential 
values for record fields. 

46. (new) An apparatus for extracting data from a file, comprising a 
computer and a computer program, performed by the computer, for: 

identifying potential locations of values of record fields in the text by 
identifying locations in the text of items in lists of known potential values 
for record fields, 

identifying a region of interest in the text by applying multiple 
candidate region partitioned, evaluating each to measure how well it 
isolates a region with a high density and a high amount of potential 
locations of values of record fields, selecting one that measures best, and 
applying it to produce a region of interest, 

segmenting the region of interest into record regions that each 
contain data for a single record by applying multiple candidate 
segmenters, evaluating each to measure how well it segments into 
regions such that each region has one field value per record field and 
such that different regions have similar numbers of field values for each 
record field, selecting one that measures best, applying it to produce 
record regions, 

extracting field values from record regions by identifying most likely 
locations of field values for each record field in each record region, and 
outputting records composed of extracted field values for record 

fields. 
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